A polynomial algorithm for balanced clustering via graph partitioning
نویسندگان
چکیده
Abstract The objective of clustering is to discover natural groups in datasets and identify geometrical structures which might reside there, without assuming any prior knowledge on the characteristics data. problem can be seen as detecting inherent separations between a given point set metric space governed by similarity function. pairwise similarities all data objects form weighted graph whose adjacency matrix contains necessary information for process. Consequently, task formulated partitioning problem. In this context, we propose new cluster quality measure uses ratio intra- inter-cluster variance allows us compute optimal under min-max principle polynomial time. Our algorithm applied both partitional hierarchical clustering.
منابع مشابه
A Polynomial Algorithm for Balanced Clustering via Graph Partitioning
The objective of clustering is to discover natural groups in datasets and to identify geometrical structures which might reside there, without assuming any prior knowledge on the characteristics of the data. The problem can be seen as detecting the inherent separations between groups of a given point set in a metric space governed by a similarity function. The pairwise similarities between all ...
متن کاملDoubly Balanced Connected Graph Partitioning
We introduce and study the Doubly Balanced Connected graph Partitioning (DBCP) problem: Let G=(V,E) be a connected graph with a weight (supply/demand) function p:V→ {−1,+1} satisfying p(V )= ∑ j∈V p(j)=0. The objective is to partition G into (V1, V2) such that G[V1] and G[V2] are connected, |p(V1)|, |p(V2)|≤cp, and max{ |V1| |V2| , |V2| |V1|}≤cs, for some constants cp and cs. When G is 2-connec...
متن کاملStreaming Balanced Graph Partitioning for Random Graphs
There has been a recent explosion in the size of stored data, partially due to advances in storage technology, and partially due to the growing popularity of cloud-computing and the vast quantities of data generated. This motivates the need for streaming algorithms that can compute approximate solutions without full random access to all of the data. We model the problem of loading a graph onto ...
متن کاملBootstrap clustering for graph partitioning
Given a simple undirected weighted or unweighted graph, we try to cluster the vertex set into communities and also to quantify the robustness of these clusters. For that task, we propose a new method, called bootstrap clustering which consists in (i) defining a new clustering algorithm for graphs, (ii) building a set of graphs similar to the initial one, (iii) applying the clustering method to ...
متن کاملA graph-based clustering method for a large set of sequences using a graph partitioning algorithm.
A graph-based clustering method is proposed to cluster protein sequences into families, which automatically improves clusters of the conventional single linkage clustering method. Our approach formulates sequence clustering problem as a kind of graph partitioning problem in a weighted linkage graph, which vertices correspond to sequences, edges correspond to higher similarities than given thres...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: European Journal of Operational Research
سال: 2021
ISSN: ['1872-6860', '0377-2217']
DOI: https://doi.org/10.1016/j.ejor.2020.07.031